搜索资源列表
clusty.tar
- 聚类分析的文本组合 压缩文件用于聚类分析的数据处理-cluster analysis of the text composition compressed files for Cluster Analysis of Data Processing
ProgrammingPCollectivePIntelligence
- 本书以机器学习与计算统计为主题背景,专门讲述如何挖掘和分析Web上的数据和资源,如何分析用户体验、市场营销、个人品味等诸多信息,并得出有用的结论,通过复杂的算法来从Web网站获取、收集并分析用户的数据和反馈信息,以便创造新的用户价值和商业价值。全书内容翔实,包括协作过滤技术(实现关联产品推荐功能)、集群数据分析(在大规模数据集中发掘相似的数据子集)、搜索引擎核心技术(爬虫、索引、查询引擎、PageRank算法等)、搜索海量信息并进行分析统计得出结论的优化算法、贝叶斯过滤技术(垃圾邮件过滤、文本过
An-Introduction-to-Parallel-and-Vector-Scientific
- In this text, students of applied mathematics, science and engineering are introduced to fundamental ways of thinking about the broad context of parallelism. The authors begin by giving the reader a deeper understanding of the issues through a genera
SC_demo
- 整理图像特征点提取和分类的程序(可以作为场景分类的前期工作),自己调试过能运行,特征点提取用的SIFT算法,使用K-means聚类算法,将得到的20个聚类中心写入txt文本中-Finishing the image feature point extraction and classification procedures (which can be as the preparatory work of the scene classification), their own debugging
The-programming-collective-wisdom
- 《集体智慧编程》(programming collective intelligence building smart web 2.0 applications)以机器学习与计算统计为主题背景,专门讲述如何挖掘和分析web上的数据和资源,如何分析用户体验、市场营销、个人品味等诸多信息,并得出有用的结论,通过复杂的算法来从web网站获取、收集并分析用户的数据和反馈信息,以便创造新的用户价值和商业价值。全书内容翔实,包括协作过滤技术(实现关联产品推荐功能)、集群数据分析(在大规模数据集中发掘相似的数
hierarcial
- Performs hierarchical cluster based on an graph provided in a text file based on edge adjacency.
vsm
- 用TFIDF方式实现了特征向量空间的建立,将文本文件先进行特征词的准备 再生成特征向量的形式,为接下来的聚类做了准备。用C-The TFIDF way to achieve the establishment of the feature vector space, the preparation of the text file first feature words and then generate the form of feature vectors, and made prepara
textcluster
- Text Clustering, Kmeans Cluster Stop word Handler TermVector TFIDFMeasure Tokeniser
NativeBayes
- Hadoop文本分类算法研究,实现了hadoop集群来对文本进行分类,其中主要算法为朴素贝叶斯分类算法。-Hadoop text classification algorithm to achieve a Hadoop cluster for text classification, feature extraction, weight calculation
DataMiningCluster-master
- 数据挖掘的聚类算法实现 Implementation of text clustering algorithms including K-means, MBSAS, DBSCAN-data mining cluster
wine
- wine数据集,用于聚类分析,关联分析,文本挖掘-wine data set for cluster analysis, correlation analysis, text mining
文本分析聚类实战
- 文本挖掘是从大量的文本数据中抽取隐含的,求和的,可能有用的信息。 通过文本挖掘实现 ?Associate:关联分析,根据同时出现的频率找出关联规则 ?Cluster:将相似的文档(词条)进行聚类 ?Categorize:将文本划分到预先定义的类别里(Text mining is a kind of information that is extracted from a large number of text data, which may be useful. Implementa